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SYSTEM AND METHOD FOR EXPRESSION PROTEOMICS BASED ON ISOTOPE 

RATIO MODIFICATION 

5 

FIELD OF THE INVENTION 

The invention relates to expression proteomics, and particularly, to a system and method 
for analyzing expression proteomics based on a modification or modifications to isotope ratio. 

1 o BACKGROUND OF THE INVENTION 

Proteomics seeks to monitor the flux of protein through a biological system under 
variable developmental and environmental influences as programmed by the genome 
(Whitelegge, "Plant proteomics: BLASTing out of aMudPIT," Proceedings of the National 
Academy of Sciences of the United States of America, Session 99, pp. 11564-11566 (2002)). 

15 Consequently, it is necessary to measure changes in protein abundance and turnover rate as 
faithfully as possible. In the absence of non-invasive technologies, the majority of proteomics 
approaches involve destructive sampling at various time points to obtain 'snapshots' that 
periodically report the genome's product. Thus, quantitation has become the major challenge 
facing the field as it matures. Because of the variability of day-to-day measurements of protein 

20 quantities, a common feature of quantitative proteomics is the use of stable isotope coding to 
distinguish control and experimental samples in a mixture that can be profiled in a single 
experiment (Gygi et al , "Quantitative analysis of complex protein mixtures using isotope-coded 
affinity tags," Nature Biotechnology, Vol. 17, pp. 994-999 (1999); Conrads et al, "Current 
strategies for quantitative proteomics," Advances in Protein Chemistry, Vol. 65, pp. 133-159 

25 (2003)). Isotope coding was introduced to allow the distinction between two samples in a single 
mass spectrometry experiment in order to achieve improved relative quantitation since run-to- 
run variability is eliminated (Tao and Aebersold, "Advances in quantitative proteomics via 
stable isotope tagging and mass spectrometry.," Current Opinions in Biotechnology, Vol. 14, pp. 
110-118 (2003)). Both chemical modification (Gygi et al, 1999) and in vivo labeling 

30 approaches (Pasa-Tolic et al, "High throughput proteome-wide precision measurements of 
protein expression using mass spectrometry," Journal of the American Chemical Society, Vol. 
121, pp. 7949-7950 (1999)) have been successful. In vivo pulse-chase experiments with stable 
isotopes allow for measurement of protein turnover rates (Pratt et al. , "Dynamics of protein 
turnover, a missing dimension in proteomics," Molecular and Cellular Proteomics, Vol. 1, pp. 
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579-591 (2002); Whitelegge et al, "Mass Spectrometry of the Lactose Permease," 
Transmembrane Transporters, Receptor Biochemsitry and Methodology Series, M. Quick, ed. 
pp. 179-190 (2002)). Coding with stable isotopes can be achieved by growth of an organism in 
depleted/enriched media or by chemically modifying proteins after extraction from the 
organism. There are a number of stable isotope strategies currently used to study biological 
systems. Each of these strategies is predicated on the use of a complete or near-complete 
isotope swap in the molecules, sample or entire organism being analyzed. This process is 
sometimes referred to as "conversion" with reference to the molecules, sample, or entire 
organism. 

In general, a non-natural isotope is "swapped" for the naturally occurring variety. For 
instance, 15 N may be swapped for I4 N, 13 C for 12 C, or ls O for 16 0, such that a peptide's mass is 
altered by several Daltons. A complete (or near-complete) isotope swap may be achieved by 
any number of strategies well known to those of skill in the art. By way of example, these 
strategies may include isotope-coded affinity tag ("ICAT") technology, stable isotope labeling 
with amino acids in cell culture ("SILAC"), enzymatic molecular introduction (i.e., exchange) of 
O, and growth of a biological sample, a cell culture, or an entire organism in stable isotopes. 
By inducing an isotope swap of this nature, it becomes possible to examine a range of biological 
processes at the molecular level. 

After the development of isotope ratio mass spectrometers, it was shown that C was 
naturally depleted in plants and that the extent of depletion varied among plants (Nier and 
Gulbransen, "Variations in the relative abundance of the carbon isotopes," Journal of the 
American Chemical Society, Vol. 61, pp. 697-698 (1939); Murphy and Nier, "Variations in the 
relative abundance of the carbon isotopes," Physical Review, Vol. 59, pp. 771-772 (1941)). 13 C 
isotope depletion measurements were integral to the discovery of C4 photosynthesis and 
crassulacean acid metabolism CAM; C3 plants are C depleted by about 30% compared to 10- 
15% in C4 and 10-25% for CAM plants. There are several origins of 13 C depletion including 
thermodynamic fractionation by the carboxylation activity of ribulose-bisphosphate 
carboxylase/oxygenase (RUBISCO, 29% depletion) and various physical contributions related 
to solubility, diffusion and hydration of CO2 in water combined with biological bias from 
enzymes that impinge upon these properties. Furthermore, while isotope fractionation during 
small molecule flux is quite well understood, less is known about the process with respect to 
protein synthesis. Recent literature has reported minor variations in isotope ratio for different 
amino acids from specific protein sources as well as positional effects within amino acids (Jim et 

a!., "Effects of hydrolysis on the delta 13 C values of individual amino acids derived from 
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polypeptides and proteins," Rapid Communications in Mass Spectrometry, Vol. 17, pp. 2283- 
2289 (2003); Sacks and Brenna, "High-precision position-specific isotope analysis of 13C/12C 
in leucine and methionine analogues," Analytical Chemistry, Vol. 75, pp. 5495-5503 (2003)). 

18* 

O bias has been reported in plants and has the potential to interfere with calculation of accurate 
13 C/ 12 C ratios when the second " 13 C" isotope peak is included in calculation of isotope ratio 
(Schmidt et at, " ls O pattern and biosynthesis of natural plant products," Phytochemistry, Vol. 
58, pp. 9-32 (2001)). Minor contributions from D/H and 15 N / 14 N fractionation might also 
contribute minor variability. 

While isotope swapping technology has important implications and uses in the study of 
molecular biology, it is significantly limited in its applications. A number of disadvantages have 
emerged, including isotope effects upon retention during liquid chromatography (Zhang et ah, 
"Fractionation of isotopically labeled peptides in quantitative proteomics," Analytical 
Chemistry, Vol. 73, pp. 5142-5149 (2001)), loss of separation space in the mass spectrometer 
and expense. Furthermore, it is generally not applicable, for example, to the study of living 
humans and other animals. Although isotopically modified animals have been produced (Wu et 
aL 9 "Metabolic Labeling of Mammalian Organisms with Stable Isotopes for Quantitative 
Proteomic Analysis," Analytical Chemistry, Vol. 76, Issue 17, pp. 4951-9 (2004)), inducing a 
complete swap of isotopes in living humans - even if it could be achieved, which is by no means 
a simple assumption - is not likely to be a viable option for a number of reasons including 
significant health concerns and the tremendous expense and time required to effect such a 
change. Unlike the labeling of bacteria and plants, in which the entirety of the carbon content 
can be derived from C0 2 with concomitant isotopic labeling of all the polypeptides, the labeling 
of the polypeptides of animals is not as simple. The most straightforward way to manipulate the 
carbon isotopic ratio in animals would be by incorporating 13 C-enriched amino acids into the 
diet. Unfortunately, because of the differential flux of free amino acid pools, it is quite possible 
that polypeptides will not be homogeneously labeled. For example, the concentrations of the 
branched chain, aromatic and sulfur containing amino acids in humans are only minimally 
affected by dietary supplement (Millward and Rivers, "The nutritional role of indispensible 

i 

amino acids and the metabolic basis for their requirements," European Journal of Clinical 
Nutrition, Vol. 42, pp. 367-393 (1988)). A more direct comparison was done that shows that 
supplementation by both leucine and lysine at similar concentrations leads to a 2-fold increase in 
lysine concentration over that of leucine even though both amino acids are found in similar 
abundance in proteins and would therefore be expected to have the same metabolic flux 
(Bergstrom et al, "Effect of a test meal, without and with protein, on muscle and plasma free 
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amino acids/ 5 Clinical Science, Vol. 79, pp. 331-337 (1990)). Furthermore, performing 
polypeptide analysis based on partial 13 C labeling via dietary amino acids is complicated by the 
many amino acids that are biochemically close to core metabolic pathways that would be 
expected to undergo rapid isotopic shuffling. For example, alanine and glutamate can undergo a 
5 transamination to become pyruvate and ketoglutarate, respectively; in a study of the 
incorporation of dietary labeled amino acids in chicken feed into egg protein only 11% of the 
alanine and 7% of the glutamate were incorporated without metabolic transformation (Berthold 
et al, "Uniformly I3 C-labeled algal protein used to determine amino acid essentiality in vivo," 
Proceedings of the National Academy of Sciences of the United States of America, Session 88, 

10 pp. 8091-8095 (1991)). Additionally, as might be expected, methionine partially behaved as an 
amino acid containing one less carbon due to methionine's role in methyl transfer biochemistry. 
These differences in flux suggest that studies in higher eukaryotes might be best performed by 
labeling the amino acids that show the lowest turnover. 

These methods are expensive because of the need for high isotope purity. Furthermore, 

15 the fact that two peptide isotope distributions replace one leads to a practical loss of separation 
space in the mass spectrometer demanding more efficient peptide separations. Moreover, the 
second isotope distribution may trigger MSMS in proteomics experiments wasting mass 
spectrometer time. Thus, in order to implement isotope swapping techniques in the study of 
such systems, a different methodology is believed to be required. Such a methodology may 

20 have a dramatic impact on the study of biology and biological systems, and particularly human 
and animal biology, as well as other systems that may not be appropriate for study in connection 
with the conventional methods described above. 



SUMMARY OF THE INVENTION 

25 Described herein are a system and method for subtly modifying isotope distribution for 

expression proteomics. The system and method include administering to an organism a 
composition with a modified isotope profile. The invention further includes monitoring the 
turnover of identified peptide sequences. 

In another embodiment of the present invention, a process for decoding isotope 

30 distribution is provided. In various embodiments of the present invention, the isotope ratio of 
particular peptides or polypeptides in an organism is modified. Various molecules may be 
modified 13 C for I2 C, I8 0 for 16 0, 15 N for 14 N and the like. The isotope ratio of 13 C: 12 C may, for 
example, be modified from a ratio of 100:1 to 200:1. The isotope modification of the present 
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invention may be implemented by various means including, but not limited to, ICAT, SILAC, 
enzymatic exchange, growth in stable isotopes, diet and injection. 

BRIEF DESCRIPTION OF FIGURES 
5 FIGURE 1 depicts spectrographs of altered isotope profiles of peptides from cells grown 

in media with altered CI C ratio in accordance with an embodiment of the present invention. 
The spectra are MALDI-TOF spectra of a tryptic peptide from phycocyanin A obtained from 

■I <2 

cultures of Synechocystis sp. PCC 6803 grown under elevated C. The spectra show increased 
abundance of isotopic species containing C. 

10 FIGURE 2 depicts spectrographs of altered isotope profiles of peptides from cells grown 

in media with altered CI C ratio in accordance with an embodiment of the present invention. 
The ion-trap ESI spectra were recorded on an ion-trap mass spectrometer showing spectra of the 
same peptide as in Figure 1. The spectra are single scans from a 'triple-play' protein 
identification experiment. The relative abundance of the isotopomers yielding 13 C/ 12 C ratio 

1 5 potentially provides relative expression measurement in parallel with protein identification. 

FIGURE 3 demonstrates performance of protein identification by tandem mass 
spectrometry under conditions of modified 13 C/ 12 C isotope ratio in accordance with an 
embodiment of the present invention. Tandem mass spectra from the 'triple-play' experiment 
shown in Figure 2 are shown for control (a) and 3% samples (b), both derived from the doubly 

20 charged parent. Figure 3 shows a notable shift of higher mass b and y fragments in (b). 

1 "X 

Increased abundance of C isotope containing species did not affect protein identification. 
Performance was compromised in the 6% sample (not shown). Successful peptide identification 
allows the use of elemental composition in calculation of CI C ratio. 

25 DESCRIPTION OF THE INVENTION 

The present invention is based on a novel approach to the study of molecular biology, by 
inducing subtle modifications in the isotope ratio of particular peptides and polypeptides for 
expression proteomics. "Subtle," as used herein with reference to the modification of isotopes 
included in target molecules, is defined as a "swapping" of, on average, an amount of isotopes 
30 included in target molecules such that there is a measurable effect upon the observed peptide 
isotope distribution, without causing a gross extension or displacement of the single isotope 
envelope. The modification introduced is gross compared to natural isotopic variability yet 
subtle compared with strategies that seek full exchange. Isotope ratio is calculated for specific 
peptides or polypeptides based upon their isotopic distributions obtained by high-resolution 
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mass spectrometry. This requires either an estimate of elemental composition based upon mass 
and average amino acid elemental composition (averagine), or the precise elemental 
composition based upon the peptide sequence determined by tandem mass spectrometry and 
protein identification algorithms, as in typical proteomics experiments. Successful 
implementation of the present invention was demonstrated by modest elevation of the 13 C/ 12 C 
ratio in Synechocystis sp. PCC 6803 cultures as a model for other organisms. Small fluctuations 
of isotope ratios occur in living organisms as a result of metabolic bias and nutrition (Meier- 
Augenstein et aL, "Applied gas chromatography coupled to isotope ratio mass spectrometry/ 5 
Journal of Chromatography, Vol. 842, pp. 351-371 (1999)). In this model, carbon was supplied 
as C0 2 /HC0 3 (from bicarbonate) and entered the metabolic cycle via carbon fixation. 

Unless defined otherwise, technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Singleton et aL, Dictionary of Microbiology and Molecular Biology 2nd ed., J. Wiley 
8c Sons (New York, NY 1994), provides one skilled in the art with a general guide to many of 
the terms used in the present application. 

One skilled in the art will recognize many methods and materials similar or equivalent to 
those described herein, which could be used in the practice of the present invention. Indeed, the 
present invention is in no way limited to the methods and materials described. Alternate 
methodologies and procedures may be readily implemented without undue experimentation. 
The presently disclosed embodiments are therefore to be considered in all respects as illustrative 
and not restrictive. Example methodologies and procedures may be readily understood by 
reference to the Results and Examples described below. Furthermore, all publications referred 
to herein are incorporated in their entirety. 

Subtle modifications may be induced with reference to any suitable target isotope or 
isotopes, which may include, but are in no way limited to, 13 C for 12 C, ls O for 16 0, and I5 N for 
14 N. Additional isotopes suitable for subtle modification in accordance with alternate 
embodiments for the present invention will be readily appreciated by those of skill in the art; for 
instance, deuterium may be swapped for hydrogen. 

1 "X 1 T 

Subtle alteration of the ratio of . C to C may be particularly advantageous in 
connection with the methods of the present invention, insofar as these methods are implemented 
in connection with proteomics. Carbon is the most abundant constituent of proteins, and, thus, a 
small change in the ratio of I3 C to 12 C has the most dramatic effect upon isotopic distribution; 
changing this ratio from 100:1 to 100:2 (or 200:1), for instance, may have quite a dramatic 
effect. In the context of proteomics, this subtle alteration of isotopic ratio can be measured from 
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the isotopic distribution of peptide ions; thereby providing a means of stable isotope tagging that 
does not require full conversion to a non-natural isotope. For example, Figure 1 illustrates a 
MALDI readout from Synechocystis cultures in which subtle modifications of target molecules 
were induced (normal IR, +1.5%, +3.0% and +6%) 13 C, respectively; from top to bottom of 
5 Figure), in accordance with an embodiment of the present invention. 

The methods of the present invention are by no means limited to the study of proteomics, 
however. In fact, the invention may find application in a host of biological systems, as well as ' 
non-biological systems (i.e., in the study of any system in which isotopes may be subtly 
modified and thereafter analyzed by the methods described herein). 

10 Methods for inducing the subtle modifications incorporated in various aspects of the 

present invention will similarly be recognized by and may be readily implemented by those of 
skill in the art without undue experimentation. In addition to the use of variants of the 
technology described above (ie., ICAT, SILAC, enzymatic exchange, and growth in stable 
isotopes), other means for inducing the subtle modification of isotopes can be readily 

1 5 ascertained. For example, a living animal may be fed a diet that includes the non-natural isotope 
or isotopes sought to be introduced into their internal physiology; diet influences actual isotope 
ratios in animals, including humans. Such a diet may include, for example, food (e.g., animal 
chow) supplemented with the non-natural isotope or, in another embodiment of the invention, 
the diet may include deuterated or deuterium-enriched water. Alternatively, the isotope may be 

20 introduced by pharmacological means (e.g., a diet supplement), or by other conventional forms 
of administration (e.g., injection of saline consisting of the non-natural isotope). The particular 
mode of adminstration may be selected based upon the physiological process to be studied. 

Once the target molecule (or molecules) has been subtly modified, it may be studied by a 
number of different technologies. For example, isotope ratio mass spectrometry ("IRMS") may 

25 be employed, whereby levels of different isotopes (e.g., C and C) may be measured after 
conversion to carbon dioxide (e.g., by combustion). Alternatively, the isotope ratio may be 
measured by calculation from the peptide mass spectra obtained by various forms of mass 
spectrometry (e.g., MSMS). Specific protein molecules can be identified by MSMS in 
connection with an appropriate database search, or isotopic distribution can be estimated using 

30 averagine (i.e., a model amino acid with elemental components occurring at frequencies deduced 
from the PIR database). In particular embodiments of the invention, high resolution mass 
spectrometry, such as Fourier transform mass spectrometry ("FTMS"), may be particularly 
advantageous and accurate. Moreover, in instances where peptide sequences (and, thus, 
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elemental compositions) are known, the measurements attainable on high-resolution 
instrumentation, such as mass spectrometers employing FTMS, may be highly accurate. 

The present invention has a range of applications. In one embodiment, the invention 
may be used in connection with isotope coding by subtle alteration of isotope ratio in proteins, 
5 peptides and polypeptides. This may be particularly useful in connection with proteomics. For 
instance, one may study the relative expression of various proteins in a biological system. Two 
(or more) samples can be distinguished by their isotope ratios; thereby allowing mixing and 
relative expression measurement by comparison of peak height/areas (i.e., in a MALDI readout). 
The isotope ratio of a peptide in a mixture is determined by relative contribution from non- 
10 labeled as compared with labeled material. 

In another embodiment, the present invention may be used to study protein turnover 
(e.g., one may monitor metabolic or transcriptional activity by seeking out newly altered or 
transcribed proteins, respectively). Protein turnover may be measured using pulse-chase 
protocols. For instance, if a human begins to eat food with an altered isotope ratio, then this will 
15 first be observed in rapidly turning-over proteins. Slow-turning proteins will be last to manifest 
the change in isotope ratio. 

Among the benefits of employing the methods of the present invention in connection 
with, for example, proteomics, is the significant cost savings and the ability to study systems 
that were heretofore impossible to study by way of isotope swapping. 

20 

13 12 

The C/ C isotope ratio of Synechocystis sp. PCC 6803 grown in culture was altered 
modestly via manipulation of the source of C0 2 for photoautotrophic growth (bicarbonate). 
Soluble proteins were separated from the membrane fraction after mechanical disruption of the 
cells. Membrane proteins were precipitated with acetone, dissolved in formic acid and subjected 

25 to analysis by liquid chromatography electrospray-ionization mass spectrometry with fraction 
collection (LC-MS+). Fractions were reduced, alkylated and digested with trypsin prior to 
MALDI-TOF analysis. Figure 1 compares the isotopic distribution for a peptide derived from 
phycocyanin A from control and the 13 C supplemented cultures. The change in isotopic 
distribution is readily apparent from the spectra and 13 C/ 12 C ratio was calculated from peak 

30 heights and areas using the Isosolv algorithm. The Isosolv algorithm, explained further in 
Example 7, measures carbon isotope distribution. Specifically, the probability of having c n' 
13 C 5 s given C X' total carbons and a 13 C probability of T' is based on the following formula: 
prob(n)=combin(X,n)*Pn* ( 1 -P)(X-n). 
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Given an elemental composition, Isosolv is used to estimate 13 C/ i2 C ratio. For any 
particular molecular weight, the number of carbons is estimated by dividing the molecular 
weight by 1 10 (the average mass of an amino acid) and multiplying by 4.94 (the average number 
of carbons per amino acid). In turn, given a measured isotopic distribution, the C probability 
5 can be determined by calculating the difference between the measured distribution and the 
theoretical distribution for an arbitrary 13 C abundance. The estimated 13 C abundance parameters 
can then be incrementally altered until the error between the theoretical distribution and the 

1 'K 

calculated distribution has been minimized, thereby yielding the C probability in the measured 
spectrum. 

10 Table 1 summarizes the l3 C/ 12 C isotope ratios of peptides derived from their isotopic 

distribution 3 . 
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a The Isosolv algorithm (experimental) was used to calculate 13 C/ 12 C ratio for the peptide 
FLS STELQIAFGR (amino acids 18-30 phycocyanin A; Q54715; elemental composition 
C67 HI 06 N17 O20; calculated M+H* 1468.7794 Da. 

b Targeted 13 C addition. 
5 c Sum of 300-500 laser flashes. 

LCQ-DECA single zoom scan data. 

Use of peak height versus area for calculation of isotope ratio. 
Not available. 

10 There was a general decrease in signal to noise in the higher supplementation samples 

(see Figures 1(c) and 1(d)) with signals appearing at every unit across the mass range. This 
makes identification of the monoisotopic peak problematic and points to the benefits of subtle 
modification of isotope ratio. Figure 2 shows the same peptide from LC-MSMS analysis of 
reduced, alkylated and trypsinized proteins from the Synechocystis sp. samples. For this 

15 experiment, the zoom scan feature of the ion-trap mass spectrometer operating in data-dependent 
acquisition mode (obtained from ThermoEIectron; LCQ-DECA) was used for typical protein 
identification experiments, where ions are excluded from MSMS analysis when a zoom scan (10 
Da width) shows them to be singly charged. Both single and doubly charged ions are shown, 
again displaying the clearly altered isotopic distributions, as in Figure 1 . It should be noted that 

20 at higher 13 C supplementation the isotopic envelope for the single peptide was widened 
considerably, contributing to loss of separation space in the mass spectrometer; again subtle 
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modification of " B C? 12 C ratio appears desirable. The doubly charged ions showed broadly 
comparable isotopic distributions to the singly charged ions with greater experimental variability 
apparent. The 13 C/ 12 C ratios calculated for these spectra are shown in Table 1 and compared to 
the results from MALDI-TOF. In the case of MALDI and zoom scans on singly charged ions by 

5 ion trap, the 13 C/ 12 C ratios calculated are sufficient to distinguish the samples from each other, 
that is, they have been successfully isotope coded. However, the performance was less 
satisfactory for the doubly charged ions. Considerable variability was observed between zoom 
scans on the same peptide (see Table 1) presumably resulting from experimental variability of 
isotopomer capture in the ion-trap experiment. Use of peak area as opposed to height reduced 

10 measurement variability. Increasing the number of scans would improve statistics but is 
undesirable in the context of the proteomics experiment. Similarly, improvements in instrument 
performance with respect to accuracy, resolution and sensitivity will all improve reproducibility. 
Evaluation of next generation of linear ion-traps and Fourier-transform instruments may be 
helpful in this respect. The discrepancy between measured isotope ratio and the theoretically 

1 5 targeted ratio conferred by the bicarbonate is under investigation, but may be due to exchange of 
C0 2 with the atmosphere during the experiment. The cultures cannot be sealed due to the need 
to allow photosynthetically produced 02 to escape. 

It was desirable that the isotope coding strategy be compatible with existing protein 
identification protocols, thus, to that end, the protein identification performance was examined 

20 in the LC-MSMS experiments used for Figure 2 (MSMS on the doubly charged ions only). 

Table 2 summarizes performance of the experiment when interrogated with Sequest for 



protein identifications. Protein identification performance was observed under altered CI C 
ratios 3 . 
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a Protein identification used Sequest (ThermoFinnigan) to match experimental tandem mass 
25 spectrometry data to a database of translated Synechocystis sp. PCC 6803 open reading frames. 
'No enzyme' is selected so that all possible sequences are screened. The results of a 
representative experiment are shown. 
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D The cross-correlation coefficient (Xcorr) provides a measure of how well a tandem mass 
spectrum matches that predicted for a particular peptide. Searches that yield tryptic peptide 
matches with Xcorr >2.3 are generally significant matches. Searches that yield tryptic peptide 
matches with Xcorr >4.0 are nearly always highly significant matches with good signal to noise. 
5 c The delta correlation (ACn) is a measure of how well a number of peptide matches identify a 
specific protein. 

It is noted that the lower 13 C supplemented samples were unaffected with respect to 
peptides/proteins identified while a noticeable decline was observed for the highest C sample. 

10 The widening of the isotopic envelope may lead to less frequent triggering of the MSMS 
experiment as maximum signal intensities drop. Also MSMS spectra may fail to yield Sequest 
hits as higher mass b and y fragments become too large to fall within tolerance limits for 
database matching. In Figure 3, the MSMS spectra of the same 1468 Da peptide from 
Phycocyanin A (doubly charged parent) are compared for the control and the 3.0% 

15 supplementation samples. It is noted that while Sequest returned strong correct identifications 
for both peptides, the dominant larger b and y fragments in the 3.0% sample exhibit the mass of 
the first 13 C isotope rather than the monoisotopic mass. It is concluded that Sequest remains 
efficient until the monoisotopic species is sufficiently diminished in abundance at the highest 
13 C supplementation. Furthermore, it is concluded that the standard performance of Sequest 

20 tolerates subtle modification of isotope ratio in ranges that are useful for isotopic coding (+0.75- 
1.5% supplementation; 750-1500%). 

Modification of 13 C/ 12 C was chosen because carbon is the most abundant element in 
peptides and proteins, though the strategy could also employ 15 N: 14 N or 18 0: 16 0 manipulation as 
discussed above. Cargile (Cargile et al 9 "Synthesis/degradation ratio mass spectrometry for 

25 measuring relative dynamic protein turnover," Analytical Chemistry, Vol. 76, pp. 86-97 (2004)) 
used pulse labeling with 13 C to measure protein turnover kinetics although the use of high atom 
percentages of 13 C lead to dramatically extended isotope distributions that in the proteomics 
context would result in dramatic loss of separation space and, as is apparent in the Figures 
presented, the appearance of peaks at every unit across the mass spectrum. The data presented in 

30 Figure 1 and Figure 2 show similar features when the isotope ratio is altered too dramatically 
and emphasize the benefits of subtle modification of the isotope ratio. Based upon the data 
presented, it is suggested that doubling the 13 C/ 12 C ratio provides adequate isotope coding with 

13 

minimal extension of the isotopomer distribution. Others have pointed out the benefits of C 
depletion (Marshall et aL, "Protein molecular weight to 1 Da by 13 C, 15 N double-depletion and 
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FT-ICR mass spectrometry," Journal of the American Chemical Society, Vol. 119, pp. 433-434 
(1997); Pasa-Tolic et al, 1999) and this strategy may be beneficial in the context of SMIRP. 

For SMIRP to be useful in the context of expression proteomics it is necessary to control 
a number of variables such that significant changes in relative expression can be measured with 
5 quantifiable error. As the results show, there is significant variability associated with using 
relative isotopomer abundance for calculation of isotope ratio. While not wishing to be bound by 
any particular theory, it is believed that the origin of this variability is due to variability of 
relative isotopomer measurement by the mass spectrometer. Variability among different peptides 
as a result of their specific elemental compositions combined with metabolic bias, and natural 

1 0 variability of specific peptide isotope ratio based upon changes in flux that alter metabolic bias 
might also be significant in the context of proteomics. Measurement variability can be addressed 
by increased sampling and the use of mass spectrometers with improved sensitivity and 
resolution while the later natural variability must be explored in detail in future research. Thus, 
natural variability may provide additional information with respect to cellular flux in the future. 

1 5 SMIRP technology can be applied to any conceivable proteomics experiment including 2D gels, 
MuDPIT (Washburn et al, "Large-scale analysis of the yeast proteome by multidimensional 
protein identification technology," Nature Biotechnology, Vol. 19, pp. 242-247 (2001)), 
accurate mass and time tags (Strittmatter et a/., "Proteome analyses using accurate mass and 
elution time peptide tags with capillary LC time-of-flight mass spectrometry," Journal of the 

20 American Society for Mass Spectrometry, Vol. 14, pp. 980-991 (2003)) and SILAC (Ong et al, 
"Stable isotope labeling by amino acids in cell culture, SILAC, as a simple and accurate 
approach to expression proteomics," Molecular and Cellular Proteomics, Vol. 1, pp. 376-386 
(2002)). 

25 EXAMPLES 

The following examples illustrate a method of performing subtle isotope modification 
and monitoring peptide turnover. Modifications of these examples will be readily apparent to 
those skilled in the art seeking to perform isotope modification for expression proteomics or 
30 other applications differing from those described herein. These examples are included merely 
for purposes of illustration. 
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EXAMPLE 1 
CULTURE OF CELLS 

Synechocystis sp. PCC 6803 cells were grown autotrophically in liquid BG-11 medium 
(available from Sigma; St. Louis, MO) (Rippka et al, "Generic assignments, strain histories and 

* 

5 properties of pure cultures of cyanobacteria," Journal of General Microbiology, Vol. 1 1 1, pp. 1- 
61 (1979)) buffered with 5 mMN-tris-hydroxymethyl-2-aminoethaiiesulfonic acid (TES)-NaOH 
(pH 8.0) (available from Sigma) and supplemented with filter-sterilized mixture of NaH 13 C0 2 
(obtained from Cambridge Isotope Laboratories; 99% 13 C; Andover, MA) and NaH 12 C0 2 
(obtained from Sigma, 1.1% of i3 C). The bicarbonate mixture was added to the freshly 

10 autoclaved BG-11 medium, which contained essentially no dissolved C0 2 , and the bottles with 
the medium were sealed until further usage. The added NaH 13 C03 was calculated to be 0%, 
1.5%, 3.0%, or 6.0% of the total NaHCQ 3 taking into account natural i3 C abundance (~1 13 C/100 
12 C) of the standard BG-llmedium (20 mg/1). Cultures were started by inoculating 50 ml of the 
medium (in a 150 ml flask) with a small amount of cells followed by incubation at 28 °C with 

15 light intensity of 50 mol photons/m 2 s while shaking at 100 rpm on a rotary shaker. Every 2-3 
days the cultures were transferred to larger flasks and diluted with fresh BG-11 medium 
containing an appropriate amount of NaH CO3. The cultivation continued until the cell cultures 
reached OD730-O.5 in a total volume of 500 ml. After that, cells were harvested by 
centrifugation, washed with thylakoid buffer (50 mM MES-NaOH at pH 7.0, 5 mM CaCl 2 , 5 

20 mM MgCl 2 , 10 mM NaCl, 15% v/v glycerol, and 0.5% v/v DMSO), and then frozen in liquid 
nitrogen. 



EXAMPLE 2 
PROTEIN PREPARATION 

25 Cells were thawed rapidly and placed on ice. Protease inhibitors (obtained from Sigma; 

P8465; 50 1/1 ml aliquot) were added prior to transfer of the cell suspension to tubes containing 
glass beads (0.1 mm; 1.0-1.2 g) pre-cooled on ice. Cells were broken using a micro-beadbeater 
(obtained from Biospec Products; Bartlesville, OK) on its maximum setting (4-5 x 30 s). Cells 
were cooled on ice between each treatment. Cell breakage efficiency was assessed by extraction 

30 of cells in acetone (10 jul cells plus 1 ml 80% acetone), agitation and centrifugation; chlorophyll 
was only extracted after cell breakage yielding a blue pellet. The broken cell suspension was 
diluted 10-fold with ice cold thylakoid buffer containing protease inhibitors and decanted to pre- 
cooled centrifuge tubes. Unbroken cells were removed (500 rpm SS34; 1 min) prior to transfer 
to clean tubes and sedimentation of the membranes (20,000 rpm SS34; 30 min). The supernatant 
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was retained for soluble proteins and the pellet was resuspended in thylakoid buffer, 
homogenized (Teflon/glass) and stored at -80 °C. 



EXAMPLE 3 

5 LIQUID CHROMATOGRAPHY ELECTROSPRAY-IONIZATION MASS 

SPECTROMETRY WITH FRACTION COLLECTION 

Samples of Synechocystis membranes were analyzed by LCMS+ (Whitelegge et at, 
2002). Membrane fraction proteins (300-600 g) were precipitated at the interface of an aqueous 
chloroform/methanol phase separation (Wessel and Flugge, "A method for the quantitative 
10 recovery of protein in dilute solution in the presence of detergents and lipids./ 5 Analytical 
Biochemistry, Vol. 138, pp. 141-143 (1984)) as described (Whitelegge et al, "Toward the 
bilayer proteome, electrospray ionization-mass spectrometry of large, intact transmembrane 
proteins," Proceedings of the National Academy of Sciences of the United States of America, 
Session 96, pp. 10695-10698 (1999)). Precipitated proteins were recovered after removal of the 

15 aqueous phase and addition of methanol. Precipitated samples were dried at atmospheric 

i 

pressure for 2 min (25 °C) and dissolved in 90% formic acid (available from Sigma; 100 |nl) 
immediately prior to HPLC. Reverse phase chromatography (RPC) of intact proteins was 
performed as described previously (Whitelegge et al, 2002; Whitelegge, "Thylakoid membrane 
proteomics," Photosynthesis Research, Vol. 78, pp. 265-277 (2003); and Whitelegge, "HPLC 

20 and mass spectrometry of intrinsic membrane proteins," Methods in Molecular Biology, Vol. 
251, pp. 323-340 (2004)) using a macroporous polymeric support (obtained Polymer Labs; 
Amherst, MA; PLRP/S, 300 A, 5 m, 2 x 150 mm) at 100 jil/min (40 °C). The column was 
previously equilibrated in 95% A, 5% B (A, 0.1% TFA in water; B, 0.05% TFA in 
acetonitrile/isopropanol, 1:1, v/v) and eluted with a compound linear gradient from 5% B at 5 

25 min after injection, through 40% B at 30 minutes and to 100% B at 150 min. The eluent was 
passed through a UV detector (280 nm) prior to a liquid-flow splitter (inserted between HPLC 
detector and mass spectrometer) that made it possible to collect fractions concomitant with 
electrospray-ionization mass spectrometry (ESI-MS). Fused silica capillary was used to transfer 
liquid to the ESI source (-50 cm) or fraction collector (-25 cm). The split fractions were 

30 collected into micro-centrifuge tubes at 1 min intervals. ESI-MS was performed as described 
(Whitelegge et al, "Electrosprayionization mass spectrometry of intact intrinsic membrane 
proteins," Protein Science, Vol. 7, pp. 1423-1430 (1998)) using a triple quadruple instrument 
(obtained from Applied Biosystems; Foster City, CA; API HI). Orifice voltage was ramped from 
60 to 120 over the mass range acquired (600-2300) and the instrument scanned with a step size 
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of 0.3 amu and 1 ms dwell. Data were processed using MacSpec 3.3 5 Hypermass or 

BioMultiview 1.3.1 software (obtained from Applied Biosystems). 

i 

EXAMPLE 4 

5 TRYPSIN DIGESTION 

Selected fractions collected during LCMS+ were reduced, alkylated and treated with 
trypsin (obtained from Promega; Madison, WI; sequencing grade modified by reductive 
methylation). DTT (15 |ul, 10 mM in 50 mM ammonium bicarbonate; 30 min, 24 °C) then 
iodoacetamide (15 pi, 55 mM in 50 mM ammonium bicarbonate; 20 min, 24 °C) and finally 
10 trypsin (12.5 jil, 6 ng/1 in 50 mM ammonium bicarbonate; 3 h, 37 °C) was added to aliquots of 
fractions (10 After incubation, samples were dried by centrifugal evaporation and stored at 
-20 °C prior to analysis by LC-MSMS. 



EXAMPLE 5 

1 5 MASS SPECTROMETRY BY MALDI-TOF (MATRIX-ASSISTED LASER 

DESORPTION IONIZATION TIME-OF-FLIGHT) MASS SPECTROMETRY 

Dried reaction mixtures were re-dissolved in 5 \il of 70% acetic acid and analyzed (0.2 \il 
plus 0.5 |Ltl matrix) by matrix-assisted laser desorption ionization (MALDI) coupled to delayed 
extraction time-of-flight MS in the reflector mode (obtained from Applied Biosystems; Voyager 
20 DE STR) using -cyano-4-hydroxycinnamic acid as matrix (10 mg/ml solution in 
water/acetonitrile/TFA 30/70/0.1) and internal/external calibration with bovine insulin. 
Manufacturer supplied default settings optimized for peptides less than 6000 Da were used for 
all samples. 



25 EXAMPLE 6 

LC-MSMS (MICRO-LIQUID CHROMATOGRAPHY WITH TANDEM MASS 

SPECTROMETRY) 

Samples were analyzed by LC-MSMS with data-dependent acquisition (obtained from 
ThermoFinnigan; San Jose, CA; LCQ-DECA) after dissolution in 5 \xl of 70% acetic acid (v/v). 
30 A reverse phase column (obtained from Michrom Biosciences, San Jose, CA; 200 m x 10 cm; 
PLRP/S 5 m, 300 A) was equilibrated for 10 min at L5 |xl/min with 95% A, 5% B (A, 0.1% 
formic acid in water; B, 0.1% formic acid in acetonitrile) prior to sample injection, A linear 
gradient was initiated 10 min after sample injection ramping to 60% A, 40% B after 50 min and 
20% A, 80% B after 65 min. Column eluent was directed to a coated glass electrospray emitter 
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(TaperTip, TT150-50-50-CE-5, New Objective) at 3.3 kV for ionization without nebulizer gas. 
The mass spectrometer was operated in 'triple-play 1 mode with a survey scan (400-1500 m/z), 
data-dependent zoom scan and MSMS. Individual sequencing experiments were matched to a 
custom Synechocystis sequence database using Sequest software (obtained from 
5 ThermoFinnigan) . 



EXAMPLE 7 

CALCULATION OF ISOTOPE RATIO FROM PEPTIDE DISTRIBUTION (ISOSOLV) 

For a carbon isotope distribution the probability of having e n' 13 C's given 'X' total 
10 carbons and a 13 C probability of 'P' is described as follows: 

prob^-combKX^^Pn^l-PXX-n) 

When given an elemental composition Isosolv uses this for estimation of C/ C ratio. When 
1 5 given a molecular weight, the number of carbons is estimated by dividing the molecular weight 
by 1 10 (the average mass of an amino acid) and multiplying by 4.94 (the average number of 
carbons per amino acid). Then, given a measured isotopic distribution, the I3 C probability can be 
determined by calculating the difference between the measured distribution and the theoretical 
distribution for an arbitrary l3 C abundance. The estimated 13 C abundance parameters are then 
20 incrementally altered until the error between the theoretical distribution and the calculated 
distribution has been minimized thus yielding the l3 C probability in the measured spectrum. The 
version of Isosolv used in these examples includes natural minor contributions of D, 15 N, 17 / ls O 
only. 

25 While the description above refers to particular embodiments of the present invention, it 

will be understood that many modifications may be made without departing from the spirit 
thereof. The accompanying claims are intended to cover such modifications as would fall within 
the true scope and spirit of the present invention. The presently disclosed embodiments are 
therefore to be considered in all respects as illustrative and not restrictive, the scope of the 

30 invention being indicated by the appended claims, rather than the foregoing description, and all 
changes that come within the meaning and range of equivalency of the claims are therefore 
intended to be embraced therein. 




